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CONTROLLING DATA DELIVERY 
BACKGROUND 

[0001] Computer and other data processing systems often need to transfer data, 
usually in the form of data words, between various parts of the system as fast as possible to 
reduce latency and increase system performance. Some data can arrive at a given location 
within the system at unpredictable times, whereas other data, from another location within the 
system, can arrive more predictably. Different arrival times lead to gaps between the data 
words, and require the data words to be buffered and, in some cases, put into a predetermined 
sequence before they are passed to the given location. Buffering and sequencing the data can 
increase the time it takes the data to move from one location within the system to another and 
contributes to latency. The performance of such systems can be optimized if the data words 
are transferred at times and in a sequence that optimize a selected system parameter such as 
latency, bandwidth, or safety, rather than simply buffering the data words and then 
transferring them at one time. 

SUMMARY 

[0002] In its broad aspects, the invention encompasses a method of delivering 
data from a data input to a data output within a system, and comprises selecting a system 
performance parameter to be optimized, receiving at the data input a sequence of discrete 
data words, determining an optimum mode of delivery of the data words to the data output so 
as to optimize the selected performance parameter, and delivering the data words from the 
data input to the data output in the determined optimum mode. The optimum mode of 
delivery may include at least one of an optimum time and sequence of delivery of the data 
words. 

[0003] In one embodiment, the invention encompasses a method of delivering 
data from a data input to a data output within a system, comprising receiving at the data input 
a sequence of discrete data words, and delivering each data word to the data output as soon as 
possible after the data word is received at the data input in order to minimize latency. The 
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data words may also, if desired, be arranged into a preselected sequence before being 
delivered to the data output. 

[0004] In another embodiment, the invention encompasses a method of delivering 
data from a data input to a data output within a system, including receiving at the data input a 
sequence of discrete data words comprising a data packet, holding at least one of the data 
words first received at the data input in a queue until additional data words comprising the 
data packet are received at the data input, delivering the data words in the queue from the 
queue to the data output as the additional data words comprising the data packet are received 
at the data input with minimum time gaps between said data words and delivering the 
additional data words substantially directly from the data input to the data output as soon as 
possible after the additional data words are received at the data input to maximize bandwidth. 
The data words may also, if desired, be arranged into a preselected sequence before being 
delivered to the data output. 

[0005] In yet another embodiment, the invention encompasses a method of 
delivering data from a data input to a data output within a system, and comprises the steps of 
receiving at the data input a sequence of discrete data words, holding each of the data words 
received in storage until all data words comprising the data packet have been received, and 
delivering the data words from storage to the data output with minimum time gaps between 
the data words to maximize safety. The data words may also, if desired, be arranged into a 
preselected sequence before being sent from storage to the data output. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0006] To illustrate the invention, there is shown in the drawings a form presently 
preferred; it being understood, however, that the invention is not limited to the precise 
arrangements and instrumentalities shown. 

[0007] Figure 1 illustrates in simplified form a block diagram of a system 
according to an embodiment of the invention and through which the method of the invention 
may be carried out. 

[0008] Figures 2 and 3 illustrate the timing and sequence of delivery of data 
according to an embodiment of the invention. 
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DETAILED DESCRIPTION 

[0009] Referring to the drawings, wherein like numerals indicate like elements, 
there is illustrated in Figure 1 a system 10 that receives and processes data. In the illustrated 
embodiment, the system 10 includes a processor 12 in data communication with a data bus 
14. Data bus 14 typically connects processor 12 to other system resources, such as main 
processing units (MPUs), central agent controllers, input/output (I/O) processing agents, and 
memory units. In the context of the invention, "processor" includes, but is not limited to, 
central processing units (CPUs) and I/O processing units. 

[0010] Processor 12 receives data from bus 14, performs operations on the data, 
and outputs data to bus 14. When processor 12 requires data from a source external to it, it 
sends a request for data via link output 16. ^Data being sent to processor 12, including data 
requested by processor 12, is received at link input 1 8. Typically, data received and output 
by processor 12 are in the form of individual data words, which may be grouped into data 
packets. For example, a data packet may consist of four individual data words, designated 
arbitrarily as A, B, C, and D, in a predetermined sequence, and a header (the portion of a 
packet that precedes the actual data and that contains source and destination addresses, error 
checking, and other fields), designated arbitrarily as H. 

[0011] Data passes from link input 1 8 to a synchronizer 20. Synchronizer 20 
synchronizes data between different clock domains. For example, link input 1 8 may be 
operating at a clock frequency of 300 MHz, whereas other system components downstream 
from link input 1 8 may be running at a different clock frequency, for example, 200 MHz. 
Synchronizer 20 can insert or remove gaps between data words based on clock frequency, so 
that data arriving at link input 1 8 is sent downstream at the proper times for downstream 
components. Of course, if all system components operate at the same clock frequency, 
synchronizer 20 becomes unnecessary and can be omitted. Synchronized data from 
synchronizer 20 next move to queue 22. A queue is a first-in, first-out data structure used to 
store the data words. Data words are added to the "tail" of the queue and taken off the 
"head." 

[0012] Data words may also be placed into queue 22 by memory unit 24. Data 
required by processor 12 may, in many cases, be data stored in a memory unit such as 
memory unit 24, rather than data that must be requested from other parts of system 10 via 
link output 16. Although for simplicity a single queue 22 is illustrated, in practice there may 
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be one queue to receive data from synchronizer 20 and a separate queue to receive data from 
memory unit 24. 



[0013] In Figure 1, connections between individual system elements are shown 
using a single line for simplicity of presentation. However, in many cases, each line in reality 
will consist of multiple wires, and each wire will carry multiple data channels. The invention 
is not limited to a particular number of individual wires or channels. The data channels are 
combined on each line using time-division multiplexing (TDM). As those skilled in the art 
will understand, TDM assigns specific time intervals to data in individual channels carried on 
a line, so that for any specific interval only data for a specific individual channel is present on 
the line. In the next interval, only data for a different individual channel is present on the 
line. Queue 22 is used to separate data into individual channels by clocking out data words at 
the time intervals for their respective channels. 

[0014] Data words taken off the head of queue 22 may go directly to a multiplexer 
26 or to another queue 28. Data is sent to queue 28 and then to processor 30 when special 
packet processing by processor 30 is required. Special packet processing may include error 
correction and data formatting, if necessary. For example, if only half a data word is valid, 
special packet processing might include duplicating data. After special packet processing, the 
data from processor 30 move to multiplexer 26. Multiplexer 26 combines data from queue 22 
and processor 30 into a single TDM data stream, which is sent to synchronizer 32. 
Synchronizer 32, like synchronizer 20, is provided to accommodate differences in clock 
frequency between components upstream from synchronizer 32 and components downstream 
from synchronizer 32, including storage unit 34 and multiplexer 36, described below, and bus 
14. However, if all of the system components run at the same clock frequency, synchronizer 
32 becomes unnecessary and can be omitted. 

[0015] Synchronized data from synchronizer 32 then moves either to storage unit 
34 or to multiplexer 36. Storage unit 34 may be a latch array or a register array, so that data 
words can be read out in any order, including an order different from the order in which the 
data words were written into storage unit 34. Multiplexer 36 combines data from storage unit 
34 and synchronizer 32 into a single TDM data stream, which is sent to bus 14 and, in turn, 
processor 12. 
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[0016] The system illustrated in Figure 1 allows data to be delivered from a data 
source, such as link input 1 8 or memory unit 24, to processor 1 2 in one of three different 
modes, each of which allows the optimization of a different system performance parameter. 

ASAP Mode 

[0017] The ASAP mode delivers data from a source, such as link input 1 8 or 
memory unit 24, to processor 12 as soon as possible. The system performance parameter 
optimized in the ASAP mode is latency. Latency is greatly reduced by delivering data as 
soon as possible. In the ASAP mode, data words that are part of a data packet received at 
link input 1 8 move along a path from link input 1 8 to synchronizer 20, from synchronizer 20 
to queue 22, from queue 22 to multiplexer 26, from multiplexer 26 to synchronizer 32, from 
synchronizer 32 to multiplexer 36, and from multiplexer 36 to bus 14 and processor 12. This 
is the most direct path from link input 18 to processor 12. Alternatively, or additionally, data 
from memory unit 24 are read out sequentially and sent to queue 22, from queue 22 to 
multiplexer 26, from multiplexer 26 to synchronizer 32, from synchronizer 32 to multiplexer 
36, and from multiplexer 36 to bus 14 and processor 12. In the ASAP mode, data words are 
delivered to multiplexer 36 as soon as possible after they are received. In the ASAP mode, 
no provision is made for dealing with time gaps between the data words. 

ASAP-BW Mode 

[0018] Although the ASAP mode optimizes latency, it may not be optimal from 
the point of view of bandwidth. In the ASAP mode, there may be long time gaps between 
data words received at link input 1 8. Long time gaps between data words means idle data bus 
time, which contributes to reduced system bandwidth. The ASAP-BW mode attempts to 
optimize bandwidth, while still reducing latency. In the ASAP-BW mode, data words are 
held for a time and then delivered together, with no gaps, unless there is a very large gap 
between data words, such as might occur during a retry. 

[0019] Data arrival times are typically known, based on the source of the data 
words. From known arrival times, the system sends the data words such that the last word to 
arrive is sent immediately from synchronizer 32 through multiplexer 36 and onto bus 14. The 
last data word is always sent as soon as it is received. By knowing the expected, or typical, 
way the data words will arrive, the arrival of the last data word can be anticipated and 
delivery of the data can be optimized for sending the last data word immediately upon its 
arrival. 
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[0020] At least one of the data words first received at link input 1 8, typically the 
first data word in a data packet, is held for a time in storage unit 34 until additional data 
words in the data packet arrive at link input 18. Thus, the first data word follows a path from 
link input 18 to synchronizer 20, to queue 22, to multiplexer 26, to synchronizer 32, and to 
storage unit 34. There, the first data word is held until additional data words in the data 
packet are received at synchronizer 32. As the additional data words are received, the data 
word or data words in the data packet that are being held in storage unit 34 are clocked out to 
multiplexer 36, and onto bus 14 for delivery to processor 12. The additional data words 
received at link input 18 follow a path from link input 18 to synchronizer 20, to queue 22, to 
multiplexer 26, to synchronizer 32, and from there either directly to multiplexer 36 and onto 
bus 14 for delivery to processor 12, or to storage unit 34 and then to multiplexer 36. 

[0021] In the AS AP-BW mode, the data words first received at link input 1 8 are 
held in storage unit 34 only as long as it takes for the remaining data words in the data packet 
to arrive at synchronizer 32. Although this delays the delivery of the first received data 
words, the delay is only as long as necessary to anticipate the arrival of all of the data words 
in the data packet and eliminate the time gaps between them before delivering all of the data 
words together, with minimum time gaps, to processor 12. Eliminating the time gaps 
between words reduces idle processing time in processor 12 and thus optimizes bandwidth. 
As additional data words arrive at link input 1 8, the additional data words are delivered as 
soon as possible after they are received. This permits at least some reduction in latency, 
compared to holding all of the data words before delivering them to processor 12. Thus, 
while the system performance parameter optimized by the ASAP-BW mode is bandwidth, the 
ASAP-BW mode also offers improvement in latency. 

Safety Mode 

[0022] The Safety mode, which may also be thought of as a "no idle" mode, 
always delivers data words in a data packet together, with substantially no time gaps, to 
optimize system safety. By "safety" is meant a reduced risk of problems in processing the 
data words, such as in a CPU. Most systems work fine when data words comprising a data 
packet are sent together. Many will also work even with embedded idle cycles in the data 
packet delivered to the CPU. However, in some cases, for example if there is a bug in the 
CPU's logic or it cannot accept embedded idle cycles under certain conditions, processing 
problems can arise. Holding each of the data words received in storage until all data words 
comprising the data packet have been received and delivering the data words from storage to 
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the data output with substantially no time gaps between the data words minimizes processing 
problems and, conversely, maximizes "safety" of the processing operations. 

[0023] In the Safety mode, all data words comprising a data packet are held in 
storage unit 34 until all of the data words are received. Data words follow a path from link 
input 18, to synchronizer 20, to queue 22, to multiplexer 26, to synchronizer 32, and to 
storage unit 34 where they are held until all of the individual data words comprising a data 
packet are received. Once all of the individual data words are received in storage unit 34, 
they are then sent to multiplexer 36 and from there to bus 14 and processor 12. The Safety 
mode differs from the other modes of operation in that all data words must be received before 
they are sent to processor 12 in order to eliminate time gaps between individual words. 

[0024] Different modes can be assigned to data from different sources 
simultaneously. That is, data packets from one source may be delivered in one mode, while 
data packets from a different source are delivered in a different mode. For example, data 
from memory unit 24 may be processed in the ASAP mode, while data from link input 1 8 
may be processed in the Safety mode. 

[0025] Because multiple data channels can be time-division multiplexed onto a 
single line, it is possible to have different data channels operating in different modes 
simultaneously. For example, data may be delivered on a first channel in the ASAP mode, 
while at the same time data on a second channel are delivered in the Safety mode. Any 
reasonable and practical number of channels can be multiplexed onto a single line without 
departing from the scope of the invention. 

Critical Word Order 

[0026] In some cases, the sequence of individual data words comprising a data 
packet is not critical to proper operation of the system. In those cases, latency is improved in 
all modes by delivering data words in the same sequence in which they are received, without 
reordering them. 

[0027] However, in practice, word order within a data packet can be critical. For 
example, with some bus standards, data words within a data packet must be returned in a 
specific order, depending on how a data packet is requested by processor 12. In one known 
standard, for example, four critical word orders are used, designated as Order 0, in which 
critical word order is ABCD, Order 2, in which critical word order is BADC, Order 4, in 
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which critical word order is CDAB, and Order 6, in which critical word order is DCBA. In 
all modes according to the present invention, the individual data words comprising a data 
packet can be put into a preselected sequence before they are sent from the storage unit 34 to 
multiplexer 36 and bus 14. Critical word order within a data packet can be achieved in all 
modes by holding some of the data words in storage unit 34 and clocking them out of storage 
unit 34 in a desired sequence. 

[0028] Examples of how data words are delivered in desired sequences are 
illustrated in Figures 2 and 3. Referring first to Figure 2, data for a single channel in a two- 
channel system are shown on the top line, with individual data words being shown within 
discrete TDM intervals numbered 1 through 16. In Figure 2, data words A, B, C, and D are 
illustrated as being present, in that order, at the output of synchronizer 32, in TDM intervals 
1, 3, 5, and 7, respectively. It will be recognized that the data words at the output of 
synchronizer are in critical word Order 0. Each of the data words is in a discrete TDM 
interval, and the data words are separated by one interval. The "blank" intervals 2, 4, and 6 
separating the data words A, B, C, and D in Figure 2 represent TDM intervals for the other 
channel in the two-channel system illustrated. 

[0029] In the ASAP mode, individual data words are delivered to bus 14 via 
multiplexer 36 as soon as they appear at the output of synchronizer 32. For critical word 
Order 0, no reordering of the data words is necessary, and the data words are simply passed 
directly to multiplexer 36. The individual data words are not delayed but are passed to 
multiplexer 36 in the same TDM interval in which they appear at the output of synchronizer 
32. 

[0030] For critical word Order 2, however, the data words must be reordered into 
BADC, so even though data word A is first to appear, it cannot be sent until after data word B 
appears. In this case, data word A is sent to storage unit 34 until it is needed. As soon as 
data word B appears at the output of synchronizer 32, it is sent to multiplexer 36, followed 
immediately by data word A, the next data word in Order 2, which is clocked out of storage 
unit 34 in the next interval. The third data word in Order 2 is D, so data word C, when it 
appears at the output of synchronizer 32, must be stored in storage unit 34 until after data 
word D is received. Thus, there is a delay of two intervals until D appears at the output of 
synchronizer 32. As soon as it does, it is sent immediately to multiplexer 36, followed by 
data word C, which is clocked out of storage unit 34 in the next interval. 



PHIP\368323\1 
200313791-1 



8 



DBR# 192664 



[0031] Similarly, for critical word Order 4 (CDAB), data words A and B are sent 
to storage unit 34 until after data word D appears at the output of synchronizer 32. Data word 
C is sent as soon as it appears, followed by data word D when it appears two intervals later, 
followed immediately by data words A and B, which are clocked out of storage unit 34 in the 
next two succeeding intervals. For Order 6, data words A, B, and C are all sent to storage 
unit until data word D appears. As soon as D appears, it is sent to multiplexer 36, followed 
by data words C, B, and A, which are clocked out of storage unit 34 in that order in the next 
three succeeding intervals. 

[0032] In the ASAP-BW mode, the goal is to eliminate time gaps between data 
words. In the example illustrated in Figure 2, it is expected that data words A, B, C, and D 
will appear at the output of synchronizer 32 in that order, in TDM intervals 1, 3, 5, and 7. It 
will be recognized that the data words will appear in critical word Order 0. For Order 0, 
reordering of the data words is not necessary, but in the ASAP-BW mode the time gaps 
between the data words need to be eliminated. Because it is anticipated that data word D will 
appear at the output of synchronizer 32 in TDM interval 7, data words A, B, and C are stored 
in storage unit 34 as they appear, and are clocked out to multiplexer 36 in that order 
beginning with TDM interval 4. That is, data word A is clocked out in interval 4, data word 
B is clocked out in interval 5, and data word C is clocked out in interval 6, in anticipation of 
data word D appearing in interval 7. Data word D is sent directly to multiplexer 36 as soon 
as data word D appears at the output of synchronizer 32. Thus, data words A, B, C, and D 
are sent to multiplexer 36 in Order 0 in intervals 4, 5, 6, and 7, without time gaps between 
them. 

[0033] Similarly, for Order 2 (BADC), data words A, B, and C are stored in 
storage unit 34 as they appear at the output of synchronizer 32. Assuming that data word D 
will appear in interval 7, data words B and A are clocked out of storage unit 34 in that order 
in intervals 5 and 6, data word D is sent in interval 7, and data word C is clocked out of 
storage unit 34 in interval 8. Thus, data words B, A, D, and C are sent to multiplexer 36 in 
Order 2 in intervals 5, 6, 7, and 8, without time gaps between them. The same procedure is 
followed for Order 4 and Order 6, so that for Order 4 data words C, D, A, and B are sent to 
multiplexer 36 in intervals 6, 7, 8, and 9, and for Order 6 data words D, C, B, and A are sent 
in intervals 7, 8, 9, 10, all without time gaps between the data words. 

[0034] In the Safety mode, the goal is to wait until all data words have been 
received, and to send all of the data words in the desired Order without gaps between them. 
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Thus, as illustrated in Figure 2, individual data words are held in storage unit 34 until data 
word D is received in interval 7. At that time, the data words are clocked out of storage unit 
34 beginning with interval 7, in desired critical word Order, without gaps between them. 

[0035] The invention handles unpredictable gaps in data packets including large 
gaps (caused, for example, by retries of sending link data), where others cannot. Referring to 
Figure 3, which is similar to Figure 2, a data packet comprising data words A, B, C, and D is 
illustrated, with a three-interval gap between data words B and C. (Although a three-interval 
gap is illustrated, the invention is capable of handling much larger gaps.) Individual data 
words for the three modes and four critical word Orders are processed in the same manner as 
illustrated in Figure 2, with the exception of delaying the data because of the gap between 
data words B and C. Thus, data delivery is delayed by two additional intervals compared to 
the examples illustrated in Figure 2, but is otherwise delivered in the same manner. 

[0036] It should be understood that the foregoing examples are intended as 
illustrative and not as limiting. For example, practical considerations including design 
constraints, bus protocol timing, and other factors may result in slight variations from the 
specific examples shown, without departing from the invention. 

[0037] The invention reduces latency by delivering the data as soon as it is 
received. The invention also reduces latency by optimizing processor recalls and memory 
storage to return data in the desired critical data word order to eliminate delays caused by 
reordering the data words. In all modes, latency can be reduced even while a different system 
parameter is optimized by having all system elements forward data to the next element in the 
data path as soon as possible after receipt. 

[0038] Although the present invention and its advantages have been described in 
detail, it should be understood that various changes, substitutions, and alterations can be 
made without departing from the spirit and scope of the invention as defined by the appended 
claims. Moreover, the scope of the present application is not intended to be limited to the 
particular embodiments of invention described in the specification. As one of ordinary skill 
in the art will readily appreciate from the foregoing description, processes, machines, articles 
of manufacture, compositions of matter, means, methods, or steps presently existing or later 
to be developed that perform substantially the same function or achieve substantially the 
same result as the corresponding embodiments described herein may be utilized to implement 
and carry out the present invention. Accordingly, the appended claims are intended to 
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include within their scope such processes, machines, articles of manufacture, compositions of 
matter, means, methods, or steps. 

[0039] The foregoing describes the invention in terms of embodiments foreseen 
by the inventors for which an enabling description was available, notwithstanding that 
insubstantial modifications of the invention, not presently foreseen, may nonetheless 
represent equivalents thereto. 
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